Endogeneity in High Dimensions.
نویسندگان
چکیده
Most papers on high-dimensional statistics are based on the assumption that none of the regressors are correlated with the regression error, namely, they are exogenous. Yet, endogeneity can arise incidentally from a large pool of regressors in a high-dimensional regression. This causes the inconsistency of the penalized least-squares method and possible false scientific discoveries. A necessary condition for model selection consistency of a general class of penalized regression methods is given, which allows us to prove formally the inconsistency claim. To cope with the incidental endogeneity, we construct a novel penalized focused generalized method of moments (FGMM) criterion function. The FGMM effectively achieves the dimension reduction and applies the instrumental variable methods. We show that it possesses the oracle property even in the presence of endogenous predictors, and that the solution is also near global minimum under the over-identification assumption. Finally, we also show how the semi-parametric efficiency of estimation can be achieved via a two-step approach.
منابع مشابه
Testing Endogeneity with High Dimensional Covariates∗
Modern, high dimensional data has renewed investigation on instrumental variables (IV) analysis, primary focusing on estimation of the included endogenous variable under sparsity and little attention towards specification tests. This paper studies in high dimensions the Durbin-Wu-Hausman (DWH) test, a popular specification test for endogeneity in IV regression. We show, surprisingly, that the D...
متن کاملTesting Endogeneity with Possibly Invalid Instruments and High Dimensional Covariates
The Durbin-Wu-Hausman (DWH) test is a commonly used test for endogeneity in instrumental variables (IV) regression. Unfortunately, the DWH test depends, among other things, on assuming all the instruments are valid, a rarity in practice. In this paper, we show that the DWH test often has distorted size even if one IV is invalid. Also, the DWH test may have low power when many, possibly high dim...
متن کاملWhy do socially responsible firms pay more dividends?
Using a sample of 22,389 US firm-year observations over the period from 1991 to 2012, we find that high CSR firms pay more dividends than low CSR firms. This is consistent with our expectation that socially responsible firms may use the dividend policy to manage the agency problems related to overinvestment in CSR. The analysis of individual components of CSR provides strong support for this ma...
متن کاملDoes Higher Hospital Cost Imply Higher Quality of Care?
—This study investigates whether higher input use per stay in the hospital (treatment intensity) and longer length of stay improve outcomes of care. We allow for endogeneity of intensity and length of stay by estimating a quasi-maximum-likelihood discrete factor model, where the distribution of the unmeasured variable is modeled using a discrete distribution. Data on elderly persons come from s...
متن کاملSkill Wage Premia, Employment, and Cohort Effects in a Model of German Labor Demand
This paper studies the relationship between employment and wage structures in West Germany based on the IAB employment subsample 1975–1997. It extends the analytical framework of Card and Lemieux (2001) which simultaneously includes skill and age as important dimensions of heterogeneity. After having identified cohort effects in skill wage premia and in the evolution of relative employment meas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Annals of statistics
دوره 42 3 شماره
صفحات -
تاریخ انتشار 2014